
FIG. 1 

BLASTP - query = 147JTR1; Hit = swiss | 015393 |TMS2_HUMAN 

This hit is scoring at : 3e-66 (expectation value) 
Alignment length (overlap) : 370 
Identities : 38 h 

Scoring matrix : BLOSUM62 (used to infer consensus pattern) 
Database searched : nrdh 

Q: 36 CDGVVDCKLKSDELGCVRFDWDKSLLKIYSGSSHQWLPICSSNWNDSYSEKTCQQLGFBS 
CDGV .C. — UE CVR. .. :L::YS W P:C. . :WN: :Y. . . .C: . :G: : : 

H: 133 CDGVSHCPGGEDENRCVRLYGPNFILQMYSSQRKSWHPVCQDDWNENYGRAACRDMGYKN 

AHRTTEVAHRDFANSFSILRYNST IQESLHRSE CPSQRYISLQCSHCGLRA 

. ::: . D :.S S.::.N:: I.:.L:.S: C.S:..:SL:C CG:.. : 
NFYSSQGIVDD SGSTSFMKLNTSAGNVDIYKKLYHSDACSSKAWSLRCLACGVNLNSS 

MTGRIVGGALASDSKWPWQVSLHFGTTH EGGTLIDAQmTAAHCF FVTREKVLEG- - - 
. ...RIVGG. A ...WPWQVSLK. . .H:CGG: :1. . :W: : I'AAHC . EK L. . 
RQSRIVGGESALPGAWPWQVSLHVQNVHVCGGSIITPEWrTAAHCV EKPLNNPWH' 

TRYPSIN HIS 

WKVYAGTSNLHQLPEAAS- - IAEIIINSNYTDEEDDYDIALMRLSKH7TLSGEGICTP 
W..:AG. . . : — X — :.NY..:..: UIALH:L.KPLT. : . : :C.P 
WTAFAGILRQSFMFYGAGYQVQKVISHPNYDSKTKNNDIALMKLQKPLTPNDLVKPVCLP 

RSPAPQPQHPLQPSHLSASVNSYPGPKA 3VDKTSP FLRE VQVNLIDFKKCNDYLVY DSYL 

P LQP..L . G KTS..L...:V hi: ::CN. .VXD:.: 

N PGMMLQPEQIrCWISGWGATEEKGKTSEVLNAAKVLLIETQRCNSRYVYDNLI 

TPRMMCAGDLRGGRDSCQGDSGGPLVCEQNNRWYLAG VTSWGT GCGQRNKPGVYTKVTEV 
TP.M:CAG L:G. DSl'(jGUSGGPbV . . M W:L.G TSWU:Ul'. : . :PGVY .V. . 
TPAMICAGFLQGN^DSCQGDSGGPLVTSNNNIWWLIGDTSWGSGCAKAYRPGVYGNVMVF 

TRYPSIN SER 
LPWIYSKMEA 389 ~ 
..W1Y.:M:A 
TDWIYRQMKA 490 



FIG. 2 



Prosite search results 

PS00134 187->193TRYPSIN_HIS PDOC00124 
PS00135 334- >346 TRYPSIN SER PDOC00124 
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BLOCKS search results 

AC# Description Strength Score 

BL00495N Apple domain proteins. 1945 1582 

AA# 325 AGdlrGGrDsCqGDSGGPLVCeqNnRWyLaGvTSW (SEQ ID NO: 15) 

BL01253G Type I fibronectin domain proteins. 1641 1548 

AA# 332 rDsCQGDSGGPLVC (SEQ ID NO: 16) 

BL00134A Serine proteases, trypsin family, histidine p 1500 1524 

AA# 175 CGGTLIDAQWVLTAAHC (SEQ ID NO: 17) 

BL00021D Kringle domain proteins. 1556 1510 

AA# 341 GPLVCEQNNRWYLaGVTSWGtGCGQRNKPGVYTKVTevLPWI (SEQ ID NO: 18) 

BL01253H Type I fibronectin domain proteins. 1765 1508 

AA# 351 wYLaGvtSWGtGCGQRNKPGVYTKVTevLpWIysk (SEQ ID NO: 19) 

BL00021B Kringle domain proteins. 1547 1507 

AA# 175 CGGTLIDaQWVLTAAHCF (SEQ ID NO: 20) 

BL00495O Apple domain Droteins. 1756 1383 

AA# 360 GtGCGQRnkPGVYTKVtEvlpWIysKmeA (SEQ ID NO: 21) 

BL00134B Serine proteases, trypsin family, histidine p 1289 1299 

AA# 333 DSCQGDSGGPLVCEqNNRWYLAGV (SEQ ID NO: 22) 

BL01209 LDL-receptor class A (LDLRA) domain proteins. 1413 1274 

AA# 35 CDGWDCK1KSDE (SEQ ID NO: 23) 

BL01253F Type I fibronectin domain proteins. 1693 1270 

AA# 288 AdktSpFLREvQVnLidfkKCndylVYdSylTPrMmCAG (SEQ ID NO: 24) 

BL00495L Apple domain proteins. 1947 1263 

AA# 209 tSnlhqlpeaaSIaEIIInsNYtdeEddYDIALmrLskP (SEQ ID NO: 25) 

BL00134C Serine proteases, trypsin family, histidine p 1245 1254 

AA# 369 PGVYTKVTEVLPWI (SEQ ID NO: 26) 

BL01253D Type I fibronectin domain proteins. 1398 1217 

AA# 175 CGGtLIdaqWLTA (SEQ ID NO: 27) 
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